Robust Domain Adaptation for Machine Reading Comprehension

نویسندگان

چکیده

Most domain adaptation methods for machine reading comprehension (MRC) use a pre-trained question-answer (QA) construction model to generate pseudo QA pairs MRC transfer. Such process will inevitably introduce mismatched (i.e., Noisy Correspondence) due i) the unavailable in target documents, and ii) shift during applying domain. Undoubtedly, noisy correspondence degenerate performance of MRC, which however is neglected by existing works. To solve such an untouched problem, we propose construct additionally using dialogue related as well new method MRC. Specifically, Robust Domain Adaptation Machine Reading Comprehension (RMRC) consists answer extractor (AE), question selector (QS), model. RMRC filters out irrelevant answers estimating correlation document via AE, extracts questions fusing candidate multiple rounds chats QS. With extracted pairs, fine-tuned provides feedback optimize QS through novel reinforced self-training method. Thanks optimization QS, our greatly alleviate problem caused shift. best knowledge, this could be first study reveal influence models show feasible solution achieve robustness against pairs. Extensive experiments on three datasets demonstrate effectiveness

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Answer Networks for Machine Reading Comprehension

We propose a simple yet robust stochastic answer network (SAN) that simulates multistep reasoning in machine reading comprehension. Compared to previous work such as ReasoNet, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves result...

متن کامل

Evaluating Machine Reading Systems through Comprehension Tests

This paper describes a methodology for testing and evaluating the performance of Machine Reading systems through Question Answering and Reading Comprehension Tests. The methodology is being used in QA4MRE (QA for Machine Reading Evaluation), one of the labs of CLEF. We report here the conclusions and lessons learned after the first campaign in 2011.

متن کامل

Sample-oriented Domain Adaptation for Image Classification

Image processing is a method to perform some operations on an image, in order to get an enhanced image or to extract some useful information from it. The conventional image processing algorithms cannot perform well in scenarios where the training images (source domain) that are used to learn the model have a different distribution with test images (target domain). Also, many real world applicat...

متن کامل

Evaluation Metrics for Machine Reading Comprehension: Prerequisite Skills and Readability

Knowing the quality of reading comprehension (RC) datasets is important for the development of natural-language understanding systems. In this study, two classes of metrics were adopted for evaluating RC datasets: prerequisite skills and readability. We applied these classes to six existing datasets, including MCTest and SQuAD, and highlighted the characteristics of the datasets according to ea...

متن کامل

Evaluation Experiment for Reading Comprehension of Machine Translation Outputs

This paper proposes evaluation methods for reading comprehension of English to Japanese translation outputs. The methods were designed not only to evaluate the performance of current systems, but to evaluate the performance of future systems had the current problems been solved. The experiments have shown that the proposed methods are capable of producing results that are statistically signific...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i7.25974